A large and evolving cognate database
نویسندگان
چکیده
Abstract We present CogNet , a large-scale, automatically-built database of sense-tagged cognates —words common origin and meaning across languages. is continuously evolving: its current version contains over 8 million cognate pairs 338 languages 35 writing systems, with new releases already in preparation. The paper presents the algorithm input resources used for computation, an evaluation result, as well quantitative analysis data leading to novel insights on language diversity. Furthermore, example use large-scale cross-lingual knowledge bases improving quality multilingual applications, we case study bilingual lexicon induction framework transfer learning.
منابع مشابه
Large-Scale Cognate Recovery
We present a system for the large scale induction of cognate groups. Our model explains the evolution of cognates as a sequence of mutations and innovations along a phylogeny. On the task of identifying cognates from over 21,000 words in 218 different languages from the Oceanic language family, our model achieves a cluster purity score over 91%, while maintaining pairwise recall over 62%.
متن کاملArmada: a Model for an Evolving Database
Soon we face a common repository size scaling into petabytes, filled with data that needs to be stored and processed. However, the rapidly improving technology cannot keep up with the data growth rate, hence data processing becomes more and more an expensive and time-consuming task. This problem is of major concern, since data processing is a core process for many businesses and applications. Y...
متن کاملA Literature Review on Evolving Database
Since 90's, database has shown tremendous growth. This growth can be determined in different aspects. Different demands of every era give database a new bunch of challenges. To achieve that challenges, researchers come up with different ideas and combinations. These various combinations enhance features of database and this way database starts evolving from one period to another. Database that ...
متن کاملEvolving Database Systems: A Persistent View
Orthogonal persistence ensures that information will exist for as long as it is useful, for which it must have the ability to evolve with the growing needs of the application systems that use it. This may involve evolution of the data, meta-data, programs and applications, as well as the users’ perception of what the information models. The need for evolution has been well recognised in the tra...
متن کاملIranian Brain Imaging Database: A Neuropsychiatric Database of Healthy Brain
Introduction: The Iranian Brain Imaging Database (IBID) was initiated in 2017, with 5 major goals: provide researchers easy access to a neuroimaging database, provide normative quantitative measures of the brain for clinical research purposes, study the aging profile of the brain, examine the association of brain structure and function, and join the ENIGMA consortium. Many prestigious databases...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Language Resources and Evaluation
سال: 2021
ISSN: ['1574-020X', '1574-0218']
DOI: https://doi.org/10.1007/s10579-021-09544-6